The analysis of speech codecs using psychoacoustic measures
نویسندگان
چکیده
This paper analyses two narrowband speech codecs, the 4.8 kbit/s FS1016 coder and the 8 kbit/s G729 coder, using objective psychoacoustic measures. Four measures are used: loudness, sharpness, roughness and tonality. The results show sharpness and roughness as the two major contributing factors to the subjective difference between the two coders.
منابع مشابه
Improved Noise Weighting in CELP Coding of Speech - Applying the Vorbis Psychoacoustic Model To Speex
One key aspect of the CELP algorithm is that it shapes the coding noise using a simple, yet effective, weighting filter. In this paper, we improve the noise shaping of CELP using a more modern psychoacoustic model. This has the significant advantage of improving the quality of an existing codec without the need to change the bit-stream. More specifically, we improve the Speex CELP codec by usin...
متن کاملRate Distortion Analyses and Bounds on Speech Codec Performance
We develop new rate distortion bounds for narrowband and wideband speech coding based on composite source models for speech and perceptual PESQ-MOS/WPESQ distortion measures. It is shown that these new rate distortion bounds do in fact lower bound the performance of important standardized speech codecs, including, G.726, G.727, AMR-NB, G.729, G.718, G.722, G.722.1, and AMR-WB. The approach is t...
متن کاملPsychoacoustic and phoneme identification measures in cochlear-implant and normal-hearing listeners.
The purpose of this study is to identify precise and repeatable measures for assessing cochlear-implant (CI) hearing. The study presents psychoacoustic and phoneme identification measures in CI and normal-hearing (NH) listeners, with correlations between measures examined. Psychoacoustic measures included pitch discrimination tasks using pure tones, harmonic complexes, and tone pips; intensity ...
متن کاملAnalysis of Automatic Speaker Verification Performance over Different Narrowband and Wideband Telephone Channels
Current speaker recognition applications involve the authentication of users by their voices for access to restricted information and privileges. The speech signal is often transmitted to the recognizer through communication channels presenting different transmission characteristics. The aim of this paper is to study the effects of speech bandwidth and coding schemes on speaker verification. We...
متن کاملAssociations and dissociations between psychoacoustic abilities and speech perception in adolescents with severe-to-profound hearing loss.
PURPOSE To clarify the relationship between psychoacoustic capabilities and speech perception in adolescents with severe-to-profound hearing loss (SPHL). METHOD Twenty-four adolescents with SPHL and young adults with normal hearing were assessed with psychoacoustic and speech perception tests. The psychoacoustic tests included gap detection (GD), difference limen for frequency, and psychoacou...
متن کامل